Rank in Wordlist | Frequency | Word |
---|---|---|
2630 | 5 | 100,000 |
3816 | 3 | 144,000 |
3836 | 3 | 7,000 |
4950 | 2 | 1,000 |
5004 | 2 | 3,000 |
5005 | 2 | 3,500 |
5011 | 2 | 4,000 |
5021 | 2 | 600,000 |
5026 | 2 | 8,000 |
6074 | 2 | ene,’ |
Rank in Wordlist | Frequency | Word |
---|---|---|
7438 | 1 | 1963). |
Rank in Wordlist | Frequency | Word |
---|---|---|
4949 | 2 | %%? |
Rank in Wordlist | Frequency | Word |
---|---|---|
4083 | 3 | People's |
5172 | 2 | Côte d'Ivoire |
5472 | 2 | N'kisi |
5956 | 2 | d'Ivoire |
6303 | 2 | kuɾi'tibɐ |
8785 | 1 | K'tuvim |
8992 | 1 | M'lakhim |
9317 | 1 | N'khemya |
9318 | 1 | N'viim |
9353 | 1 | Nevi'im |
Rank in Wordlist | Frequency | Word |
---|---|---|
7708 | 1 | Abɔtre/Abɔtri |
14156 | 1 | www.jw.org/ee. |
In the last subsection of this type we look for words containing other special characters: , ( ) % & $
" ' + * = / _
Depending on the language some of these characters may be allowed within words, other will not. If words with forbidden characters do not have very low frequency there might be a problem in preprocessing.
Words containing %:
select w_id-100,freq, word from words where w_id>100 and word like "%\%%" limit 10;
3.12.1 Words with Hyphens
3.12.2 Multiwords
3.12.3 (Multi-)Words with dots